Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition

ثبت نشده
چکیده

Recently, there has been a lot of interest in automatically generating descriptions for an image. Most existing language-model based approaches for this task learn to generate an image description word by word in its original word order. However, for humans, it is more natural to locate the objects and their relationships first, and then elaborate on each object, describing notable attributes. We present a coarse-to-fine method that decomposes the original image description into a skeleton sentence and its attributes, and generates the skeleton sentence and attribute phrases separately. By this decomposition, our method can generate more accurate and novel descriptions than the previous state-of-the-art. Experimental results on the MSCOCO and a larger scale Stock3M datasets show that our algorithm yields consistent improvements across different evaluation metrics, especially on the SPICE metric, which has much higher correlation with human ratings than the conventional metrics. Furthermore, our algorithm can generate descriptions with varied length benefiting from the separate control of the skeleton and attributes. This enables image description generation that better accommodates user preferences.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Morphological skeleton representation and coding of binary images

This paper presents the results of a study on the use of morphological set operations to represent and encode a discrete binary image by parts of its skeleton, a thinned version of the image containing complete information about its shape and size. Using morphological erosions and openings, a finite image can be uniquely decomposed into a finite number of skeleton subsets and then the image can...

متن کامل

Error Free Iterative Morphological Decomposition Algorithm for Shape Representation

Problem statement: A generalized skeleton transform allows a shape to be represented as a collection of modestly overlapped octagonal shape parts. One problem with several generalized Morphological skeleton transforms is that they generate noise after decomposition. The noise rate may not be effective for ordinary images; however this effect will be more when applied on printed or handwritten c...

متن کامل

2D Shape Decomposition Based on Combined Skeleton-Boundary Features

Decomposing a shape into meaningful components plays a strong role in shape-related applications. In this paper, we combine properties of skeleton and boundary to implement a general shape decomposition approach. It is motivated by recent studies in visual human perception discussing the importance of certain shape boundary features as well as features of the shape area; it utilizes certain pro...

متن کامل

Identification of Curvature Features with Use of Boundary-Skeleton Model of Image

A method of identification of curvature features for shape description and recognition is suggested. The method is based on the relation between local boundary features and the skeleton structure of a plain domain. It uses a continuous boundary-skeleton model of an image. The model consists of a boundary of a polygonal figure approximating a raster image, and a skeleton of the figure. To reveal...

متن کامل

From skeleton branches to object parts

A method to decompose a 3D object into simple parts starting from its curve skeleton is described. Branches of the curve skeleton are classified as meaningful and non-meaningful by using the notion of zone of influence of the points where skeleton branches meet. Meaningful branches are associated to subsets of the object, which are obtained by subtracting from the input object suitably expanded...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016